# 300M parameter ViT
Webssl Dino300m Full2b 224
A 224-resolution Vision Transformer model based on 2 billion MetaCLIP data, trained using DINOv2 self-supervised learning method
Image Classification
Transformers

W
facebook
503
7
Sapiens Pretrain 0.3b
Sapiens is a vision Transformer model pretrained on 300 million high-resolution human images, specifically designed for human-centric vision tasks.
Image Classification English
S
facebook
34
1
Featured Recommended AI Models